A Clustering Strategy to Find Similarities in Mycoplasma Promoters

نویسندگان

  • João Francisco Valiati
  • Paulo Martins Engel
چکیده

This paper presents a neural network clustering strategy to identify regularities in a dataset of Mycoplasma promoter sequences. The traditional way that prokaryotic promoters are identified is proven inadequate to the Mycoplasma family. Our clustering approach tries to discover regularities in base pair compositions of the dataset sequences to give clues which indicate the presence or absence of promoters. Several experiments with leave-one-out strategy and a negative dataset revealed a best way to fit model parameters. Preliminary results are promising for creating a computational model able to find promoter regions in Mycoplasmas. IV BSB 29 Favor ver os Anais do Simpósio em Springer Verlag, Lecture Notes in Bioinformatics (LNBI número 3594) para este trabalho. Please see the Symposium Proceedings in Lecture Notes in Bioinformatics (LNBI nr. 3594), Springer Verlag, for this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

A Survey of Pathogenic Avian Mycoplasma Involvement in Multicausal Respiratory Disease in Broiler Flocks

BACKGROUND: Mycoplasma gallisepticum (MG) and Mycoplasma synoviae (MS) are the most important and pathogenic mycoplasma in chicken production. The tendency of avian mycoplasma for interaction with other pathogen is well-known. Interaction within several disease-producing factors in respiratory tract exacerbate the disease and known as multicausal respiratory disease. OBJECTIVES: In recent years...

متن کامل

GROUND MOTION CLUSTERING BY A HYBRID K-MEANS AND COLLIDING BODIES OPTIMIZATION

Stochastic nature of earthquake has raised a challenge for engineers to choose which record for their analyses. Clustering is offered as a solution for such a data mining problem to automatically distinguish between ground motion records based on similarities in the corresponding seismic attributes. The present work formulates an optimization problem to seek for the best clustering measures. In...

متن کامل

A New Approach in Strategy Formulation using Clustering Algorithm: An Instance in a Service Company

The ever severe dynamic competitive environment has led to increasing complexity of strategic decision making in giant organizations. Strategy formulation is one of basic processes in achieving long range goals. Since, in ordinary methods considering all factors and their significance in accomplishing individual goals are almost impossible. Here, a new approach based on clustering method is pro...

متن کامل

ارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها

Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005